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DETAILED ACTION 
Response to Arguments 

2. Applicant's arguments, which was filed on October, 1 8, 2007, with respect to the 
rejection(s) in regard " wherein the sound sources selected bv the user are processed 
into object sounds and other sounds not selected bv the user are p rocessed into 
background sounds ". However, Pachet et al. did disclosed of the selecting of multiple 
sound by the user and other not selected, wherein the object sounds (selected) , sound 
by the user can be consider as object sound and the background sounds funselected 
sound) , in further consider Pachet et al. one would find multiple sound and wherein the 
options of selecting sound and leaving others the same wherein all sounds must satisfy 
certain constraint and maintain spatial characteristic of the music (see , fig.1; col.4 line 
20-40 & 10-18; col.1 line 48-51; col.2 line 5-10; col.2 line 60-67). 

However, in view of the newlly rejected claim 9, the examiner will write a new 
"non-final rejection". 

Claim Rejections - 35 USC § 103 
2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all obviousness 
rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or 

described as set forth in section 1 02 of this title, if the differences between the 

subject matter sought to be patented and the prior art are such that the subject 

matter as a whole would have been obvious at the time the invention was made to a 



Application/Control Number: Page 3 

10/692,769 

Art Unit: 2615 

person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claims 1 ,4-6 are rejected under 35 U.S.C. 103(a) as being unpatentable over Fay et al.fUS 
2002/0161462 Ar) and Pachet et al.("US 6,826,28281"). 

Reclaim 1, Fay et al. disclose an object-based three-dimensional (3-D) 
audio server system comprising ( fig. 2-8; page 4 [0046] line 3 - 6 : computer 
program in which 3-D instruction is carried out " ) : an audio input unit 
receiving object-based sound sources through various input devices ( '' fig,2' 
variety audio sources (212) being inputted to input device (204) /page 5 [0056] 
line 1-3 and 8-10 " ) ; an audio editing and producing unit having the sound 
sources applied through the audio input unit into object sounds according to 
a user's selection {'' page 1[0003] line 1-7; page 2 [0024] music with 
associated character to be applied; Fig. 2 (204,206,224) ") , and converting 
them into 3-D audio scene information {"" page 3 [0041] line 6-9-scenes changes 
has corresponding audio representations in a game " ) ; and an audio encoding 
unit encoding 3-D information and object signals of the 3-D audio scene 
information converted by the audio editing and producing unit so as to 
transmit them through a medium ( '' page 2 [0023] line 8-11 " ) . 

However, Fay et al. fail to disclose of the sound editing/producing unit 
separating the sources being applied through the audio input unit into object 
sound and background sounds according to a user's selection, but Pachet et 
al. disclose a system in which an audio editing/producing unit separating the 
sources being applied through the audio input unit into object sound and 
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background sounds according to a user's selection {^'Fig.l (2); col. 2 line 5- 
10; col. 2 line 50-59; col,4 line 13-21; col ,7 /different sounds may be placed 
according with the different music instrument in the same scene by user") for 
the purpose of maintaining the consistency of music upon changing the spatial 
characteristic of the music. Thus, combined the teaching of Fay et al. and 
Pachet al. as a whole, it would have been obvious for one of the ordinary 
skill in the art to modify Fay et al. by incorporating the sound 
editing/producing unit separating the sources being applied through the audio 
input unit into object sound and background sounds according to a user's 
selection for the purpose of maintaining the consistency of music upon 
changing the spatial characteristic of the music. 

the combined teaching of Fay et al. and Pachet et al. as a whole, 
teach the of the system, However, the combined teaching of Fay et al . and 
Pachet et al. as a whole further teach of, wherein sound sources selected by 
the user from among the sound sources that have been applied through the 
audio input unit are processed into the object sound, and other sound sources 
not selected by the user are processed into background sounds (see , fig.l; 
col. 4 line 20-40 & 10-18; col . 1 line 48-51; col . 2 line 5-10; col. 2 
line 60-67/ object (selected) sound and background (unselected ) sound 
all emit simultaneously) . 



Re claim 4, the system according to claim 1 would have further 
incorporate, wherein the audio editing/producing unit includes: a 
router/audio mixer dividing the sound sources applied in the multi- track 
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format into a plurality of sound source objects and background Sounds 

fig. 4 (402,414,422 ;page 6 [0069] line 9-11 ") ',a scene editor /producer editing 
an audio scene and producing the edited audio scene by using S-D information 
and spatial information of the sound source objects and background sound 
objects divided by the router /audio m±xex{'' fig. 4 (416,418,448) ;page 6 [0076] 
line 5 '10 -particular events/scene is edited/produced "); and a controller 
providing a user interface so that the scene editor /producer edits an audio 
scene and produces the edited audio scene under the control of a user ( " page 
4 [0052] line 5-14; fig. 8 " ) . 

Re claim 5, the system according to claim 1, wherein the audio encoding 
unit includes: a data encoding block encoding each set of data divided into 
background sound objects, sound source objects, and audio scene information 
output from the audio editing/producing unit {'' page 3 [004 0] -encoded sound 
object witti corresponding scene information; fig. 4 (204,206) " ) ; and a 
multiplexer multiplexing object data of the background sound, data of the 
sound sources, and data of the audio scene information encoded by the data 
encoding block into a single signal, and transmitting the 
same ( fig. 4 (440,442) ; page 7 [0081] line 9- 12 -multiplex plurality of 
cliannels (440) to single (442) " ) . 

Re claim 6, the system according to claim 5 would have further 
incorporate the, wherein the data encoding block includes: an audio object 
encoder encoding the sound objects; an audio scene information encoder 
encoding the audio scene information; and a background sound object encoder 
encoding the background sounds (page 2 [0024] line 7-9; page 3 [0035] /sound and 
scene to be encoded) . 
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4. Claim 3 is rejected under 35 U.S.C. 103(a) as being unpatentable over Fay et al.("US 
2002/0161462 A1") and Patchet et al .{ ''2002/0161462 Al") and further in view of 
Leung et al. ("2005/0080616 Al") and Lin et al .{ "20030053680 Al" ) . 

Re claim 3, the combined teaching of Fay et al. and Pachet et al. as a 
whole, disclose the system according to claim 1 with a single channel 
microphone with a single microphone ( " page . 14 f 018 7J line 4 ");, However, Fay et 
al. and Patchet et al . as a whole, fail to disclose the audio input unit 
further includes: a combination of sound source input devices having: a 
stereo microphone with at least two microphones; a dummy head microphone 
whose shape is like a head of a human body; an ambisonic microphone receiving 
the sound sources after dividing them into signals and volume levels, each 
moving with a given trajectory on 3-D X, Y, and Z coordinates; and a multi- 
channel microphone receiving multitrack audio signals;, However, Leung et al. 
disclose of a system for recording three-dimensional auditory scene in which 
the device further include a combination of sound source input devices 
having: a stereo microphone with at least two microphones (" page 1[0004] line 
5-6; page 1 [0008] line 11-14") a dummy head whose shape is like a head of a 
human body (" page i [0008] ") ; an ambisonic microphone receiving the sound 
sources after dividing them into signals and volume levels, each moving with 
a given trajectory on 3-D X, Y, and Z coordinates (" page 1 [0005 ] ") ; and a 
multi-channel microphone receiving multitrack audio signals ( '' page 1 [0008] 
line 3 " ) ; for the purpose of recording sound signals, thus, taking the 
combined teaching of Fay et al. and Patchet et al. and now Leung et al. as a 
whole, it would have been obvious for one of the ordinary skill in the art to 
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modify the teaching of Fay et al. by incorporating the a stereo microphone, a 
dummy head microphone; an ambisonic microphone, a multi- channel microphone 
for the purpose of recording sound signals. 

The combined teaching of Fay et al. and Patchet et al. and Leung et al. 
as a whole, teach the above, However, they fail to teach of the source 
separation/ 3-D information extractor separating the sound sources applied 
from the combination of the sound source input devices by objects, and 
extracting 3-D information. However, Lin et al. teach of a three-dimensional 
sound creation in which there exist the source separation/3 -D information 
extractor separating the sound sources applied from the combination of the 
sound source input devices by objects, and extracting 3-D information ( '^ page 
2 [0015] ; fig. 1 (26, 28) " ) for the purpose of matching sound signal with 
corresponding video object data. Thus, taking the combined teaching of Fay et 
al. and Obuku et al. and Leung et al. and now Lin et al. as a whole, it would 
have been obvious for one of the ordinary skill in the art to modify the 
teaching of Fay et al. and Leung et al. as a whole, by incorporating the a 
three-dimensional sound creation in which there exist the source 
separation/3 -D information extractor separating the sound sources applied 
from the combination of the sound source input devices by objects, and 
extracting 3-D information for the purpose of matching sound signal with 
corresponding video object data as taught by Lin et al. 
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Claim Rejections - 35 USC § 102 

1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 



(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent In the United 
States. 



3. Claims 7-9 are rejected under 35 U.S.C. 102(b) as being anticipated by Ashour et al. ("US 
6,459,797 61"). 



Re claim 7, Ashour et al. discloses a method of controlling an object- 
based 3-D audio server system comprising (fig. 3-4; col.l line 55-62; col.l 
line 35-41/object is at varying positions) : separat:ing sound source objects 
from among sound sources according to a selection by a user and inputting 3-D 
information for each sound source object separated from the applied sound 
sources ( ''col. I line 5-10/user may designated the position with sources sound 
respectively being originated fig.l (130,150)") ; mixing the sound sources 
other than the separated sound source object into background sounds (col. 3 
line 25-30) ; and forming the sound source objects, the 3-D information , and 
the background sound object into an audio scene (col. 3 line 38-45; 
fig.3/sounds and image represented into scene, and encoding and multiplexing 
the audio scene to transmit the encoded and multiplexed audio signal through 
a medium (col. 1 line 20-30; line 52-58; fig.l) 
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While, Ashour et al . disclose of the above, However, Ashour et al . is 
silent in regard to the wherein the sound sources selected by the user are 
processed into sound source object and other sounds not selected are 
processed into background sounds (fig. 3-4; col. 3 line 36-51; col. 4 line 23- 
36/appropirate instrumental/ spatialized sounds/ all output at speakers) . 

Re claim 8, the method according to claim 7, wherein each of the sound 
source objects further includes 3-D information for a relative sound source 
object grouping the sound source objects that have to be controlled by groups 
(^^fig.3; col . 3 line 40-45; col . 3 line 47-50"). 

Re claim 9, the object -based three dimensional audio terminal system 
comprising: the audio decoding unit demultiplexing and decoding a multiplexed 
audio signal including object sounds, background sounds, and scene 
information applied through a medium; an audio scene synthesizing unit 
selectively the object sounds with the audio scene information decoded by the 
audio decoding unit into a 3-D audio scene under the control of a user; a 
user control unit providing a user interface so as to selectively synthesized 
the audio scene by the audio scene synthesizing unit under the control of the 
user; and audio reproducing unit reproducing the 3-D audio scene synthesized 
by the audio scene synthesizing unit (fig. 1-4; col.l line 52-60, col. 2 line 
5-10, lin2 20-55/fig.l wt ( 130, 100 , 120) /sound demultiplex at (100) and 
synthesized to be output via (170) / all under the user controlled ) . 
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Re claim 15, the system according to claim 9, wherein the user control unit 
include an interface that controls each sound source object and the 
listener's direction and position, and receives the user's control for 
maintaining realism of sound reproduced in a virtual space to transmit a 
control signal to each unit (fig.l wt (120); fig. 3; col. 3 line 35-45). 

Allowable Subject Matter 

2, Claims 10-14 are objected to as being dependent upon a rejected base claim, 
but would be allowable if rewritten in independent form including all of the limitations of 
the base claim and any intervening claims. 

3. claims 16-20 are allowed. 

Re claim 16, Fay et al. (2002/0161462) and Ashour et al. as a whole, fail to disclose of the 
perfomiing motion processing, 3-D sound localization, and 3-D information to modify and apply 
the processed object sounds and 3-D information according to a user selection, and mixing them 
with the background sound. 

Re claims, 19-20 similarly has the limitation of claim 16 above. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the examiner should 
be directed to Disler Paul whose telephpne number is 571-270-1 187. The examiner can normally be 
reached on 7:30-5:00. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor. Chin 
Vivian can be reached on 571-272-7848. The fax phone number for the organization where this 
application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent Application 
Information Retrieval (PAIR) system. Status infomnation for published applications may be obtained from 
either Private PAIR or Public PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic Business Center (EBC) 
at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative 
or access to the automated information system, call 800-786-9199 (IN USA OR CANADA) or 571-272- 
1000. 




